Robust Speech Recognition Features Based on Temporal Trajectory Filtering and Non-Uniform Spectral Compression

نویسندگان

Sang-Ho Lee

Jeong-Hyun Ha

Woo-Young Lee

چکیده

This paper proposes a new feature extraction method based on temporal trajectory filtering and nonuniform spectral compression and examines its performance with two tasks in noisy environments. Temporal trajectory filtering is effective for robust speech recognition in noisy environments, due to human hearing is more sensitive to relative values rather than absolute values and the effect of additive noise which varies slowly may be removed. However, even if noise is stationary, it is not removed exactly due to the random fluctuation. Thus we use non-uniform spectral compression after temporal trajectory filtering and this method shows better performances than the respective methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification of emotional speech using spectral pattern features

Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Noise-driven Temporal Trajectory Filtering of Spectral Parameters for Robust Speech Recognition

Spectral parameter filtering such as RASTA [5], RASTA-like bandpass filtering and high-pass filtering operates on temporal dynamics of spectral parameters, and has been effective method to reduce channel distortions. Temporal derivatives (delta and delta-delta coefficients, that have proved as robust representation) and spectral mean normalization [4] are also equivalent to filtering that rejec...

متن کامل

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Robust Speech Recognition Features Based on Temporal Trajectory Filtering and Non-Uniform Spectral Compression

نویسندگان

چکیده

منابع مشابه

Classification of emotional speech using spectral pattern features

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Noise-driven Temporal Trajectory Filtering of Spectral Parameters for Robust Speech Recognition

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

عنوان ژورنال:

اشتراک گذاری